- Home
- Search Results
- Page 1 of 1
Search for: All records
- 
                                    Total Resources1
- Resource Type
- 
                                    
                                    
                                    
                                    0001000000000000
- More
- Availability
- 
                                    
                                    01
- Author / Contributor
- Filter by Author / Creator
- 
                                    
                                        - 
                                                    
                                                        
                                                            
                                                            Geng, S (1)
- 
                                                    
                                                        
                                                            
                                                            Ivison, H (1)
- 
                                                    
                                                        
                                                            
                                                            Koh, P W (1)
- 
                                                    
                                                        
                                                            
                                                            Krishna, R (1)
- 
                                                    
                                                        
                                                            
                                                            Li, C_L (1)
- 
                                                    
                                                        
                                                            
                                                            Li, J (1)
- 
                                                    
                                                        
                                                            
                                                            Sap, M (1)
- 
                                                    
                                                        
                                                            
                                                            #Tyler Phillips, Kenneth E. (0)
- 
                                                    
                                                        
                                                            
                                                            #Willis, Ciara (0)
- 
                                                    
                                                        
                                                            
                                                            & Abreu-Ramos, E. D. (0)
- 
                                                    
                                                        
                                                            
                                                            & Abramson, C. I. (0)
- 
                                                    
                                                        
                                                            
                                                            & Abreu-Ramos, E. D. (0)
- 
                                                    
                                                        
                                                            
                                                            & Adams, S.G. (0)
- 
                                                    
                                                        
                                                            
                                                            & Ahmed, K. (0)
- 
                                                    
                                                        
                                                            
                                                            & Ahmed, Khadija. (0)
- 
                                                    
                                                        
                                                            
                                                            & Aina, D.K. Jr. (0)
- 
                                                    
                                                        
                                                            
                                                            & Akcil-Okan, O. (0)
- 
                                                    
                                                        
                                                            
                                                            & Akuom, D. (0)
- 
                                                    
                                                        
                                                            
                                                            & Aleven, V. (0)
- 
                                                    
                                                        
                                                            
                                                            & Andrews-Larson, C. (0)
 
- 
                                                    
                                                        
                                                            
                                                            
- Filter by Editor
- 
                                    
                                        - 
                                                    
                                                        
                                                            
                                                            & Spizer, S. M. (0)
- 
                                                    
                                                        
                                                            
                                                            & . Spizer, S. (0)
- 
                                                    
                                                        
                                                            
                                                            & Ahn, J. (0)
- 
                                                    
                                                        
                                                            
                                                            & Bateiha, S. (0)
- 
                                                    
                                                        
                                                            
                                                            & Bosch, N. (0)
- 
                                                    
                                                        
                                                            
                                                            & Brennan K. (0)
- 
                                                    
                                                        
                                                            
                                                            & Brennan, K. (0)
- 
                                                    
                                                        
                                                            
                                                            & Chen, B. (0)
- 
                                                    
                                                        
                                                            
                                                            & Chen, Bodong (0)
- 
                                                    
                                                        
                                                            
                                                            & Drown, S. (0)
- 
                                                    
                                                        
                                                            
                                                            & Ferretti, F. (0)
- 
                                                    
                                                        
                                                            
                                                            & Higgins, A. (0)
- 
                                                    
                                                        
                                                            
                                                            & J. Peters (0)
- 
                                                    
                                                        
                                                            
                                                            & Kali, Y. (0)
- 
                                                    
                                                        
                                                            
                                                            & Ruiz-Arias, P.M. (0)
- 
                                                    
                                                        
                                                            
                                                            & S. Spitzer (0)
- 
                                                    
                                                        
                                                            
                                                            & Sahin. I. (0)
- 
                                                    
                                                        
                                                            
                                                            & Spitzer, S. (0)
- 
                                                    
                                                        
                                                            
                                                            & Spitzer, S.M. (0)
- 
                                                    
                                                        
                                                            
                                                            (submitted - in Review for IEEE ICASSP-2024) (0)
 
- 
                                                    
                                                        
                                                            
                                                            
- 
                                    Have feedback or suggestions for a way to improve these results?
 !
                                    
                                        
                                            Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
                                            Some full text articles may not yet be available without a charge during the embargo (administrative interval).
                                        
                                        
                                        
                                            
                                                
                                             What is a DOI Number?
                                        
                                    
                                
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
- 
            Improvements in language models are often driven by improving the quality of the data we train them on, which can be limiting when strong supervision is scarce. In this work, we show that paired preference data consisting of individually weak data points can enable gains beyond the strength of each individual data point. We formulate the delta learning hypothesis to explain this phenomenon, positing that the relative quality delta between points suffices to drive learning via preference tuning--even when supervised finetuning on the weak data hurts. We validate our hypothesis in controlled experiments and at scale, where we post-train 8B models on preference data generated by pairing a small 3B model's responses with outputs from an even smaller 1.5B model to create a meaningful delta. Strikingly, on a standard 11-benchmark evaluation suite (MATH, MMLU, etc.), our simple recipe matches the performance of Tulu 3, a state-of-the-art open model tuned from the same base model while relying on much stronger supervisors (e.g., GPT-4o). Thus, delta learning enables simpler and cheaper open recipes for state-of-the-art post-training. To better understand delta learning, we prove in logistic regression that the performance gap between two weak teacher models provides useful signal for improving a stronger student. Overall, our work shows that models can learn surprisingly well from paired data that might typically be considered weak.more » « lessFree, publicly-accessible full text available July 8, 2026
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
